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Abstract. We consider compound as well as arbitrarily varying 
classical-quantum channel models. For classical-quantum compound 
channels, we give an elementary proof of the direct part of the coding 
theorem. A weak converse under average error criterion to this statement 
is also established. We use this result together with the robustification 
and elimination technique developed by Ahlswede in order to give an 
alternative proof of the direct part of the coding theorem for a finite 
classical-quantum arbitrarily varying channels with the criterion of suc- 
cess being average error probability. Moreover we provide a proof of the 
strong converse to the random coding capacity in this setting. 
The notion of symmetrizability for the maximal error probability is de- 
fined and it is shown to be both necessary and sufficient for the capac- 
ity for message transmission with maximal error probability criterion to 
equal zero. 

Finally, it is shown that the connection between zero-error capacity and 
certain arbitrarily varying channels is, just like in the case of quantum 
channels, only partially valid for classical-quantum channels. 

1 Introduction 

Channel uncertainty is omnipresent and mostly unavoidable in real-world appli- 
cations and one of the major technological challenges is the design of commu- 
nication protocols that are robust against it. The incarnation of that challenge 
on the theoretical side delivers a plethora of interesting structural and method- 
ological problems for Information Theory. Despite these facts it happened only 
recently that this range of problems received the necessary attention in Quan- 
tum Information Theory and especially in Quantum Shannon Theory [7], [15] . 
[S] , [5J. In this paper we revisit two basic models for communication under 
channel uncertainty, the compound and arbitrarily varying channels with classi- 
cal input and quantum output and give essentially self-contained derivations of 
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coding theorems for them. These results were originally obtained in [7] and [3]. 
The contributions of the paper and the difference to existing work are the follow- 
ing. First, in [9J a capacity result with strong converse for compound channels 
with a classical input and quantum output (compound cq-channel for short) un- 
der the maximum error criterion has been derived. However, the achievability 
proof given there lacks transparency and does not show that good codes with 
the uniformly bounded exponentially decreasing maximal error exist. Indeed, in 
[PJ it is merely shown that good codes exist with uniformly super-polynomially 
decreasing maximal error probability. Here we prove that sharper result for the 
average error criterion and, at the same time, give a significantly simpler proof 
of the achievability part of the coding theorem based on a universal hypothesis 
testing result which is a generalization of the technique developed by Hayashi 
and Ogawa in |25j . The passage to the maximal error criterion can be carried 
out via a standard argument which can be found in [9]. 

It is interesting to compare this result with related work of Hayashi [5T] and 
Datta and Hsieh [17] . The works [3T] and [TTJ aim at showing the existence 
of codes depending on the input distribution and a prescribed rate only and 
achieving an exponential but channel dependent decay of error probability for 
all cq-channels whose Holevo information is strictly larger than that prescribed 
rate. The good codes in our approach depend on the input distribution and the 
set of cq-channels generating the compound cq-channcl. Additionally we obtain 
a uniform exponential bound on error probabilities, a property that seems highly 
desirable in case that the channel is unknown. 

Moreover, we prove the weak converse to the coding theorem under average er- 
ror criterion by a reduction to the strong converse for the maximal error via a 
lemma of Ahlswede and Wolfowitz from [3] . 

Second, once we have the achievability result for compound cq-channels we can 
obtain the corresponding results for arbitrarily varying cq-channels (AVcqC) in 
a straight-forward fashion via Ahlswede's powerful elimination [J] and robustifi- 
cation [5] techniques. This way, we obtain an alternative approach to the coding 
theorem for AVcqCs which was originally proven by Ahlswede and Blinovsky in 

m- 

Finally, we show that a naive quantum analog of Ahlswede's beautiful relation 
[3] between Shannon's zero-error capacity [2 7) and the capacity of arbitrarily 
varying channels subject to maximal error criterion does hold neither for AVc- 
qCs when employing the maximal error criterion nor for the strong subspace 
transmission over arbitrarily varying quantum channels. The latter communica- 
tion scenario is widely acknowledged as a fully quantum counterpart to message 
transmission subject to the maximal error criterion. 

2 Notation and Conventions 

All Hilbert spaces are assumed to have finite dimension and are over the field 
C. The set of linear operators from TL to H is denoted B(H). The adjoint of 
b G B(H) is marked by a star and written b* . The notation (-, -)hs is reserved 
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for the Hilbert-Schmidt inner product on B(1~L). 

S(H) is the set of states, i.e. positive semi-definite operators with trace 1 acting 
on the Hilbcrt space %. Pure states are given by projections onto one-dimensional 
subspaccs. A vector i £ H of unit length spanning such a subspacc will there- 
fore be referred to as a state vector, the corresponding state will be written 
|x)(x|. For a finite set X the notation ^P(X) is reserved for the set of probability 
distributions on X, and |X| denotes its cardinality. For any 1 6 N, we define 
X' := {(xi, . . . , x;) : Xi G S Vi G {1, . . . , I}}, we also write x l for the elements 
of X ; . For any natural number N, we define [N] to be the shortcut for the set 
{1,...,N} 

The set of classical-quantum channels (cq-channcls) mapping a finite alphabet 
X to a Hilbcrt space H is denoted CQ(X.,H). Since CQ(X, H) is the set of 
functions W : X — > It is naturally equipped with the norm |[ • || cg (which 

is inherited from the usual one-norm || ■ j|i on operators) and is defined by 

\\W\\ cg := max||W(a;)||i (W G CQ(X,H)). 

It is common, to embed the set ^P(X) of probability distributions into 2?(C' X '), 
i.e. to fix an orthonormal basis {e x } X £x in C' x ' and assign to every p in *P(X) an 
element of Z?(C' X ') which is diagonal in this basis. For a channel W G CQ(X, %) 
and a given input probability distribution p G ^3(X) one defines the correspond- 
ing state on C' x ' <g> % by 

p:=^2p(x)\e x )(e x \<E)W(x). (I) 

The set of measurements with N E N different outcomes is written 
M N (H) := {(D 1 ,...,D N ) : ££i A < t n and A > Vi € [N]}. To 
every (D±, . . . ,Dn) g M.n{H) there corresponds a unique operator defined by 

The von Neumann entropy of a state p G 5(H) is given by 

S(p):=-tr(p\ogp), (2) 

where log(-) denotes the base two logarithm which is used throughout the paper 
(accordingly, cxp(-) is reserved for the base two exponential). For two states 
p. a G 5("H), the quantum relative entropy is defined by 

n , I, s ftr(plogp- plogtr) ifkero-Ckerp 

D (p\W) : = < . , (3) 

I +oo else. 

The Holevo information is for a given channel W G CQ(X-,W) and input prob- 
ability distribution p G CP(X) defined by 

X (p, W) := S(W) - P(x)S(W(x)) = ^ p(x)D(W(x)\\W), (4) 
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where W is defined by W := ~Yl, x( z-^p{x)W(x). This quantity is concave w.r.t. 
the input probability distribution and convex w.r.t. the channel. Its concavity 
property follows directly from the concavity of the von Neumann entropy, its 
convexity in the channel is by joint convexity of the quantum relative entropy. 
For an arbitrary set W C CQ(X.,H) we denote its convex hull by conv(W) (for 
the definition of the convex hull, [55] is a useful reference). In fact, for a set 
W := {W 8 } ses 

conv(W) = J W q 6 CQ(X,H) : W q = ^ q(s)W s , q G «P(S), |supp(g)| < oo I , 
I ses J 

(5) 

because of Caratheodory's Theorem. 



3 Definitions 

3.1 The compound classical-quantum channel 

Let W C CQ(~K,H). The mcmorylcss compound cq-channel associated with W 
is given by the family {W 8I }(gn,n'6W- With slight abuse of notation it will be 
denoted W or, if necessary, 'the compound cq-channel W for short. In the 
remainder, using arbitrary index sets T, we will often write W = {Wt}ter 
to enhance readability. Before we continue, let us put a brief remark in order 
to explain why this subsection contains no definition of random codes (while 
subsection 13.21 does) : 

Remark 1. We abstain from defining random codes for compound cq-channels, 
the reason for this being that they do offer no increase in capacity. For the reader 
interested in the topic, we briefly outline one way of arriving at this conclusion. 
First, the capacity of compound channels, seen as a function from the power set of 
the set of channels with given input and output systems to the reals, is continuous 
(this can fact can be proven by an argument very similar to the one given for 
compound quantum channels in Sect. 8 of together with continuity of the 
single channel classical capacity, cf. [23]). This allows for an arbitrarily good 
(speaking in terms of their capacity) approximation of infinite compound cq- 
channels by finite ones, so that we can restrict our discussion to finite compound 
cq-channels. 

Second, given such a finite compound cq-channel and a sequence of 

random codes which achieve a given rate r with asymptotically vanishing average 
error, we may simply use it for the memoryless cq-channel W := Ster 
Since the average error is a convex function of the channel, this implies the 
existence of a sequence of deterministic codes at the same asymptotic rate with 
vanishing average error for W. 

Using affinity of the average error criterion once more, we see that the very 
same sequence of deterministic codes also has vanishing average error for the 
cq-compound channel {Wt}t^T, only with a slightly slower convergence. As in 
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the definition of W, the assumption that \T\ < oo holds is crucial at this point 
of the argument. This shows that random codes cannot have higher asymptotic 
rates than deterministic ones, if one insists on asymptotically vanishing average 
error. 

For the maximal error criterion, it is enough to note that both the random and 
the deterministic capacity for transmission of messages over a compound cq- 
channel using that criterion are upper bounded by the respective capacities for 
the average error criterion. 

Definition 1. An (/,M;)-code for message transmission over a compound cq- 
channel W C C<5(X,'H) is a family {x l m , D l m )^ l =1 , where x{, . . . ,x l M € X' and 
(D[,...,D l Ui )€M Ml (H® 1 ). 

Definition 2. For A g [0, 1) , a non-negative number R is called a A-achievable 
rate for transmission of messages over the compound cq-channel W = {Wt}teT 
using the average error criterion if there is a sequence {(u l m , D l m )^ l =1 }i e fq of 
(/, Mi)-codes with 

lim inf — log Mi > R and 

l— s-oo / 

-, Mi 

hmsup sup — V tT{W®\u l m ){t H m - D l J) < A. 

Z^oo teT M f—' 
^ m— 1 

Definition 3. For A G [0, 1), a non-negative number R is called a A-achievable 
rate for transmission of messages over the compound cq-channel VV = {T^tjter 
using the maximal error criterion if there is a sequence {(u l m , D l m )^' =1 }i^fi of 
(I, Mi) -codes with 

lim inf — log M; > R and 

l— >oo / 

limsup sup max tr(Wf l (u l m )(l n s>i - D l m )) < A. 

i-s-oc teT me[Afj] 

Definition 4. For A € [0, 1), the A-capacity for message transmission using the 
average error criterion of a compound cq-channel W is given by 

R is a X-achievable rate for 
Cc(W, A) := sup ^ R : transmission of messages over W > . (6) 

using the average error probability criterion \ 

The number Cc(W,0) is called the weak capacity for message transmission 
using the average error criterion ofW and abbreviated Cc(W). 

Definition 5. For A G [0, 1), the A-capacity for message transmission using the 
maximal error criterion of a compound cq-channel VV is given by 

R is a X-achievable rate for transmission 
Cc(W, A) := sup ^ R : of messages over W > . (7) 

using the maximal error probability criterion I 
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The number Cc (W, 0) is called the weak capacity for message transmission using 
the maximal error criterion 0/ W and abbreviated Cc(W). 

3.2 The arbitrarily varying classical-quantum channel 

Let A C CQ(X,'H). In the remainder we will write A = {A s } se s, where S 
denotes an index set, in order to enhance readability. We also set 



The arbitrarily varying classical-quantum channel associated with A is given 
by the family {A s i jvgs'.ieN- Again, with slight abuse of notation it will be 
denoted A or, if necessary, 'the AVcqC A for short. 

In this work, we will always consider the set S to be finite. Generalizations of 
our results to the case of arbitrary sets can be done by standard techniques (see 
[()])■ We will now define random codes and the random capacity emerging from 
them. In order to do so, we have to clarify a few things. 

A code for an AVcqC A will, for some choice of I, N <G N, be given by a 
probability measure in on the set ((X') w x Mn{H® 1 ), Si), where Si is a 
suitably chosen sigma-algebra. 

It has to be taken care that a function / defined by 
((x[,...,x l N ),(D[,...,D l N )) i-> mhv eS1 ^ Eili tr{W s i (x|)D|} is measur- 
able w.r.t. Si. Also, in order to define deterministic codes later, Si has to 
contain all the singleton sets. In the remainder, we shall assume that such a 
choice is always made. 

An explicit example of such a sigma-algebra is given by the Borcl 
sigma-algebra defined using the topology induced by the metric 



({x,D),(x',D')) ^ (! - 6{x,x')) + \\D - D'\\ 2 where 5(x,x) = 1 Vz € X 



and equal to zero else, and for sake of simplicity, we set I = N = 1. Finally, we 
note that the function / mentioned above is continuous w.r.t. to that metric. 
In the following definitions, let A 6 [0, 1). 

Definition 6. An (I, Mi) -random code for message transmission over A = 
{A s } se s is a probability measure [ii on ((X l ) Ml x A / Jat( - H <8 '), Si). In order to 
shorten our notation, we write elements of (X l ) Ml x M.n{T~L® 1 ) in the form 



Definition 7. An (I, M;)-deterministic code for message transmission over A = 
{A S } S £S is given by a random code for message transmission over A with \ii 
assigning probability one to a singleton set. 

Definition 8. A non-negative number R is called A- achievable for transmission 
of messages over the AVcqC A = {A s } sg s with random codes using the average 
error criterion if there is a sequence (///)jgn °f {I, Mi) -random codes such that 
the following two lines are true: 



A s , :=®\ =1 A H . 



(8) 



(4«r. 



liminf -logM( > R 



(9) 
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r 1 Mi 

limsupmax / — £tr (A s i (x[)(l n ®, - D[)) d^i{{u\, D\)^ x ) < A. (10) 

l— too s £S J ±V±i . 

%— 1 

Definition 9. ^4 non-negative number R is called A- achievable for transmission 
of messages over the AVcqC A = {A s } sg s with deterministic codes using the 
average error criterion if it is X-achievable with random codes by a sequence 
(/■*z)zeN which are deterministic codes. 

Definition 10. The A-capacity for message transmission using random codes 
and the average error criterion of an AVcqC A is given by 

{R is a X-achievable rate for transmission of "] 
R : messages over A with random codes > . (11) 

using the average error probability criterion \ 

The number Ca,t(A, 0) is called the weak capacity for message transmission us- 
ing random codes and the average error criterion of A and abbreviated Ca,i{A). 

Definition 11. The A-capacity for message transmission using deterministic 
codes and the average error criterion of an AVcqC A is given by 

R is a X-achievable rate for transmission of} 
CA,d(-4) A) := sup ^ R : messages over A with deterministic codes > . (12) 
using the average error probability criterion J 

The number C x,d{A 1 0) is called the weak capacity for message transmission 
using deterministic codes and the average error criterion of A and abbreviated 
C AA {A). 

Definition 12. A non-negative number R is called A-achicvablc for transmis- 
sion of messages over the AVcqC A = {A s } sg s with deterministic codes using 
the maximal error probability criterion if there is a sequence of (/, Mi) -random 
codes with each \xi being a deterministic code such that the following two lines 
are true: 

liminf jlogM; > R (13) 



limsupmax max / tr (A s i (x\)(l n ®i - D-)) dm((u\, -D-)fi\) < A. (14) 

Definition 13. The A-capacity for message transmission using deterministic 
codes and the maximal error probability criterion of an AVcqC A is given by 

{R is a X-achievable rate for transmission } 
R : of messages over A with deterministic codes > . (15) 
using the maximal error probability criterion \ 

The number Caa(A, 0) is called the weak capacity for message transmission 
using deterministic codes and the maximal error criterion of A and abbreviated 
C Atd (A). 
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The following definition will turn out to be useful to decide whether a given 
AVcqC has nonzero capacity for transmission of messages using average error 
criterion and deterministic codes. 

Definition 14. Let A = {A s } seS C CQ(X,H) be an AVcqC. If, for every 
x, x' G X, we have 

conv({A s (x)} seS ) n conv({ A s (x')} seS ) ^ 0, (16) 

then A is called m-symmetrizablc. 



3.3 Zero-error capacity 

Definition 15. An (I, Mi) zero-error code for a stationary memoryless cq- 
channel defined by V G CQ(X.,TL) is given by a family (x\, D l A i ^ 1 , where 
x[,...,x l Mi e X' 'and (D{, . . . , D l Mi ) G M Ml (U® 1 ) satisfy tr(V® l (x\)Dl) = 1 
for every i G [M{\ . 

Definition 16. The zero-error capacity for message transmission over the cq- 
channel V € C<5(X,H) is given by 

Co(V) := lim -logmaxjM; : 3 (I, Mi) zero — error code for V}. (17) 



4 Main Results 



We now enlist the main results contained in this work. We will not state the 
results obtained in Subsection 16.31 These evolve around the relation between 
zero-error capacities and arbitrarily varying channels. They include both mes- 
sage transmission and entanglement transmission. Rather than stating a positive 
result, in this section we argue that certain straightforward quantum analogues 
of results that are valid in the classical theory do not hold. As always, this is 
a delicate task that involves much more than just embedding a commutative 
subalgcbra into a non-commutative one. We therefore encourage the reader to 
consider this last subsection as something that should be read separately and in 
one piece. 

Our first result is the following. 

Theorem 1 (cq Compound Coding Theorem). For every compound cq- 
channel W G CQ(X,"H) it holds 

C C (W)= max M x(p,W). (18) 

In subsection l6.ll an analogue of the Ahlswede dichotomy from [4] for arbitrarily 
varying classical-quantum channels will be derived. This statement has originally 
been obtained by Ahlswede and Blinovsky in [7]. The precise mathematical for- 
mulation reads as follows. 
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Theorem 2 (Ahlswede-Dichotomy for AVcqCs). Let A = {A s } seS C 
CQ(X,"H) be an AVcqC. Then 

1) C A ,M) =C C (conv(i)) (19) 

2) If C A , d (A) > 0, then C A , d (A) = C A , r (A). (20) 

Also, this section contains the following statement, which asserts, that every 
sequence of random codes whith error strictly smaller than 1 for all but finitely 
many blocklenghts will not achieve rates higher than the rightmost term in (fT9"|) . 

Theorem 3 (Strong converse). Let A := {yl s } se s be an AVcqC. For every 

A e [0, l) 

C Atr (A,\)<C c (conv(A)) (21) 

holds. 

Remark 2. The result can be gained for arbitrary (infinite) AVcqCs with only 
trivial modifications of the proof given below. 

In the next subsection 16.21 wc show that the capacity for message transmission 
over an AVcqC using deterministic codes and the maximal error probability 
criterion is zero if and only if the AVcqC is m — symmetrizable. 
This is an analog of Theorem 1]. It can be formulated as follows. 

Theorem 4. Let A = {A s } seS C CQ(X,H) be an AVcqC. Then C A ,d(A) is 
equal to zero if and only if A is m- symmetrizable. 

5 Compound cq-channels 

In this section, we consider compound cq-channels and give a rigourous proof 
for the achievability part of the coding theorem under the average error crite- 
rion together with a weak converse. The channel coding problem for compound 
cq-channels was treated, restricted to achievability, by Datta and Hsieh [T7] for 
a certain class of compound channels, and Hayashi [21] , In our proof, we exploit 
the close relationship between channel coding and hypothesis testing which was 
utilized by Hayashi and Nagaoka before. With focus set on the maximal er- 
ror criterion, the compound cq channel coding theorem was proven in [9] already 
where also a strong converse theorem was proven for this setting. 
For orientation of the reader we sketch the contents of this section. In Lemma [TJ 
we reduce the problem of finding good channel codes for a finite compound chan- 
nel to the problem of finding good hypothesis tests for certain quantum states 
generated by this channel. The existence of hypothesis tests with a performance 
sufficient for our purposes is shown in Lemma [5] In order to establish the coding 
theorem for arbitrary compound channels, we recall some approximation results 
in Lcmma|ni With these preparations, we are able to prove the direct part of the 
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coding theorem. Additionally, we give a proof of the weak converse (for which 
we utilize the strong converse result for the maximal error criterion given in [3] 
in Theorem [1]). A strong converse for coding under the average error criterion 
does not hold in general for compound cq-channels (for further information, see 
Remark gj . 

We consider a compound channel W := {W t }teT C CQ(X,H) where T is a fi- 
nite index set. We fix an orthonormal basis {e x } x£ x in C' x '. For W and a given 
input probability distribution p G *P(X) we define for every t G T states 

pt ■= ^ P(x)\e x )(e x \ W t (x), and d- t :=p®a t , (22) 

on C' x ' <g> "H, where p and at are defined by 

p := ^2p{x)\e x )(e x \, and a t := ^ p(x)W t (x). (23) 

With some abuse of notation, we use the letter p for the probability distribution 
as well as for the according quantum state defined above. Moreover, we define 
for every I G N states 

n--=^ Vl&flv * =pm ®W\^ afl (25) 

where v t : (C |x| <g> H)® 1 (C |x| )®' ® U® 1 is the ismorphism permuting the 
tensor factors. The next lemma is a variant of a result by Hayashi and Nagaoka 
in , which states that good hypothesis tests imply good message transmission 
codes for the average error criterion. Here it is formulated and proven for the 
states pi and r;. 

Lemma 1. Let W := {Wt}teT C CQ(X,'H) be a compound cq-channel with 
\T\ < oo, p € *P(X), and I G N. Let further pi, 77 be the states associated to 
W,p as defined in \2J$ and \25}). If for A G [0, 1], and a > exists a projection 
qi G £((Cl x l)^ (gJ-H® 7 ) which fulfills the conditions 

1- tr{q lPl ) > 1 — A 
2. tr(gjTi) < 2~' a ; 

£/ien /or any 7 with a > 7 > and M; := [2^ a_7 ^J there is an (I, M{)-code 
(i^DmeiAy with 



Pi: 

1 1 teT 



Mi 

Ter Mi 



± J2 MWf - ^)) < |T|(2A + 4 • 2"^) (26) 



m— 1 



The following operator inequality is a crucial ingredient in the proof of the lemma 
above, it was given in a more general form by Hayashi and Nagaoka in |20j . 
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Lemma 2. Let a,b G B(7i) be operators on % with < a < 1 and b > 0. TTiert 

1-H - (a + 6)"5 a ( a + j,)-T < 2{l n - a) + 46, (27) 
where denotes the generalized inverse. 

Proof. See Lemma 2 in [20]. □ 

Proof (of Lemma\f$. Let I £ N, (ft a projection such that the assumptions of 
the lemma are fulfilled, and 7 a number with < 7 < a. According to the 
assumptions, qi takes the form 



= l^'X^'l ® 5x' ; (28) 



where fei £ £(H® ; ) is a projection for every x l G X'. Set M; := L2 i(a_7) J, 
and let Ui, Um 1 be i.i.d. random variables with values in X', each distributed 
according to the /-fold product p® 1 of the given distribution p. We define a 
random operator 

I Mi \" i /Mi \~5 

An := f ^ J 9t/ m f 9c/ n J (29) 

for every m € [Mi] (we omit the superscript I here), where again generalized 
inverses are taken. The particular form of the decoding operators D\, Dmi in 
eq. (f2T)|) guarantees, that 

Mi 

X D m < Ik®' 

holds for every outcome of f 1, Ujf, , and therefore (U m , -D m )me[Mi] is a random 
code of size M;. The remaining task is to bound the expectation value of the 
average error of this random code. We introduce an abbreviation for the average 
of the channels in W by 

t=i 

The error probability of the random code is bounded as follows. By virtue of 
Lemma [2j 



E 



tr 



(w\u m )(l n ®i - -Dm))] <2 E [tr(wVm)(l«a« -«t/jj 

+ 4- X T&[tr(W l (U m )q Un )] (30) 



mG[Mi] 
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holds. The calculation of the expectation values on the r.h.s. of the above equa- 
tion is straightforward, we obtain for every m G [Mi] 



E[tr(W {U m )(l H9 , - q Um ))] = tr{ Pl {l n m - qi)), 



and, for n =/= m, 



E 



tr 



(w\u m )q Ur 



trinqi). 



(31) 



(32) 



Together with the assumptions of the lemma, eqns. (J5TJ and (|32[) imply 



E 



< 2A + 4 • M; • 2~' a 



< 2A + 4-2"' 7 



Because this error measure is an affine function of the channel we conclude, that 
there exists a cq-code {x\ n , D m ) m l =1 for W with average error bounded by 

, Mi 

— £ tT(Wf\x l J(l n ^ - D m )) < \T\(2X + 4 • 2-^) (33) 

m— 1 

for every i G T, which is what we aimed to prove. □ 

The next two lemmata contain facts which are important for later considerations. 
The first lemma presents a bound on the cardinality of the spectrum of operators 
on a tensor product space which are invariant under permutations of the tensor 
factors. The group Si of permutations on [I] is, on V.® 1 , represented by defining 
(with slight abuse of notation) for each a € Si the unitary operator a G B(7i,® ) 

a(vi ® ... ® vi) := iv-i(i) ® ... ® Wo--i(i). (34) 

for all product vectors Vi (8 ... ® G C z and linear extension to the whole space 

Lemma 3. Let Y G B{U® 1 ) (d := dimH > 2 J .satis/?/ ctF = Ya for every 
permutation a £ Si. Then 

|spec(F)| < (l + l) d \ (35) 

Proof. It is clear that, under the action of Si, H® 1 decomposes into a finite 
direct sum W® 1 = where the Hij are irreducible subspaces of 

Si, mi G N their multiplicity and M G N. Moreover, Hi,j ~ Hi,k f.a. i G [M], 
j, fc G [mi] and to every such choice of indices there exists a linear operator 
Qi,j,k ■ T-LiM ^ such that gO, , ■,. = 6>..,>< T f.a. a € Si. 

Let us write Y = J2i,j Y t,m, 3 ,n, where ii, m ,i,n : ^ %i,m- Then according 
to Schur's lemma, Y i<tn j >n = 0, (i ^ j) and ii, m ,i,„ = Ci^ n Qi, m ,n for all valid 
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choices of indices and unique complex numbers Ci <m ,n £ C. 

Thus, defining the self-adjoint operators Yj := X)m\i=i c i,m,nQi,m,n, we see that 

M 

Y = J2 Y > ( 36 ) 

i=l 

holds. Obviously, li, m ,i lOT = m . Thus, with an appropriate choice of bases in 
every single one of the H.i^ m and defining the matrices C, by {Ci) mn := Cj lJrl)n , 
we can write a matrix representation Y^ of Yj as Y^ = Ci ® ljjdimc« 4i i). 
Clearly then, each of the Y^ can have no more than m, different eigenvalues. 
Since supp(Yi) _L supp(Yy) (i ^ j), we get 

M 

|spec(Y)| < ^m 4 . (37) 

i=l 

Now, taking a look at [13], equation (1.22), we see that < (I + l) d / 2 holds. 
The number M is the number of different Young tableaux occuring in the repre- 
sentation of Si on a.® 1 and obeys the bound M < NT([d} 1 ), where ATy ([<£]') is the 
number of different types on [dj l , that itself obeys Nt([(I\ 1 ) < (I + l) d (Lemma 
2.2 in [13]). For d > 2 we thus have 

M 

I s P cc(Y)| < ^rrii < {I + l) d2/2 (/ + l) d < (i + l) d2 . (38) 

i=l 

□ 

Lemma [3] provides the result which will, together with Lemma [T] imply the 
existence of optimal codes for W. We give a proof which is based on an idea of 
Ogawa and Hayashi which originally appeared in |25l . An important ingrcdience 
of their proof is the operator inequality stated in the following lemma. 

Lemma 4 ([19]). Let x be a state on on a Hilbert space K,, and A4 := 
{Pk\k=i be a collection of projections on JC with Pk — 1jc- Then 

the operator inequality 

K 

X<K-Y^PkXPk (39) 

k=l 

holds. 

Lemma 5. For every S > 0, finite compound cq-channel W := {Wt}t 6 T C 
CQ(X, %) andp € ^P(X) there exists a constant c, such that for every sufficiently 
large I € N there exists a projection q L s € B((C^) & ® H® 1 ) which fulfills 



1- H%SPl) > 1 - |T| ■ 2~ ,s , and 
2. tr(q ltS Ti) < 2- , (°-*) 
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where pi, 77 are the states belonging to W,p according to \2J$ and h25\) . and a is 
defined by a := min te [ T ] D(p t \ \p <8> a t ). 

Proof. Let 6 > be fixed, for 1 £ N, we have ran(p;) C ran(-n) := Hi, which 
allows us to restrict ourselves to Hi, where r; is invertible. For every e <E (0, 1), 
we define a regularized version pi >£ to pi by 

pi. £ := (1 -e)pi +sti. (40) 

These operators are invertible on T-Li and approximate pi, i.e. 

\\pi,e-pih<2e. (41) 

holds for every e > 0. We also define an operator 

Pl,e~ E E m,eE X , (42) 

Aespcc(T()\{0} 

which is the pinching of pi_ e to the eigenspaces of ti (here E\ is the projection 
which projects onto the eigenspace belonging to the eigenvalue A for every A e 
spec(-n)). This definition guarantees 

t iPl e = Pl e ti. (43) 

With a as assumed in the lemma, we define the operator 

T e :^p^-2 l ^Ti (44) 

with spectral decomposition 

T £ = (45) 

/i£Espcc(T £ ) 

The projection qu onto the nonnegative part of T £ , defined by 

)i&pec{T e ):fi>0 

will now be shown to suffice the bounds stated in the lemma. Clearly, qi t sT e q^s 
is a positive semidefinite operator, therefore, with (|44[) the inequality 

qLsnqLs < 2 
is valid. Taking traces in (|Tf)) yields 

to(qi,sTi) < 2 
< 2 

which shows, that qi t $ fulfills the second bound in the lemma. We shall now 
prove, that q^s for I large enough actually also suffices the first one. To this end 



^ a -Vqup, e qu. 



- lia - S) tT(Ql,SPl„ 
-l{a-S) 



(47) 



(48) 
(49) 
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we derive an upper bound on tr((l — qi,s)pi,e) for any given e > 0, which implies 
(together with (jH])) a bound on tr((l — qi,s)pi)- In fact it is sufficient to find 
an upper bound on tr((l — qi t s)pi e ), which can be seen as follows. Because /5; e 
and 77 commute by construction (see eq. (1431) ). T £ and 77 commute as well. This 
in turn implies that qi : $ commutes with the operators Ei, E\ spec r Tl \\ in the 
spectral decomposition of 77 which eventually ensures us, that 

ti((l n ®i - qi.s)p Le ) = ti((l n m - qi.s)pi,e) (50) 
holds. For an arbitrary but fixed number s € [0, 1] we have 

tr((l n m - qi, S )pi t£ ) = tx{fif ~ s) pl s {t n ^i - qi,g)) (51) 
<2- l '(°-Qtr$ 1 -') 7 f(l H9l - %6 )) (52) 
< 2- ls ( a - s hr(p { l 1 - s) T l s ). (53) 

The inequality in (j52[) is justified by the following argument. Since p e ; and 77 
commute, they are both diagonal in the same orthonormal basis {ffi}f =1 , i.e. 
they have spectral decompositions of the form 

d d 

Pi,e =^2xi\9i)(9i\, and n = ^ j 6 i \g i ){g i \. (54) 

i=l i=l 

Because qi.s projects onto the eigenspaces corresponding to nonnegative eigen- 
values of T e , we have 

- Qi,s = ^2 \9i)(9i\, ( 55 ) 

where the set N is defined by N := {i g [rf] : Xi - 2 l( - a ~ s) 6i < 0}. It follows 

Xt < 2 ls( - a ~ s) e s t (56) 
for all i e N and s e [0, 1]. This in turn implies, via (g3J and ([5*4]) . 

^, e (l«.. - qi,s) < 2' s ( a - 5 )rf(l H « ! - q LS ), (57) 
which shows (jS"2")) . Combining cqns. (|5U|) and ([53")) we obtain 

tr((l« 9 , - «B,j)p M ) < ^-'hvCp^rn 

= 2 , '<°-^trfa )e T*/£r*) 

= 2 l ^- s hx{p l , e rf-piX). (58) 

Here we used the fact, that p ; e and 77 commute in the first equality. Eq. (|58[) is 
justified, because the eigenprojections of 77 wich appear in the definition of p ; E 
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are absorbed by t ; 2 . We can further upper bound the above expressions in the 
following way. Note, that 

Pi,e < |spec(T/)|p, e . (59) 

holds by Lemma 2] Because — (-) _s is an operator monotone function for every 
s e [0, 1] (see e.g. [8]), d59j) implies 

Pie < |spec(rO| s ^ E s . 
Using the above relation, one obtains 

tripi^p-ST?) < |spec(r i )| s tr(p i!£ r^p- 6 s r / f ). 
By combination with (|58p this leads to 

tr((l w8 i - qi,s)pi,e) < I Bpec(7))|'2''(°- 4 )tr(p I , e T*/»j; e 'r,*) (60) 

< (I + if cxp{Z[(a - S)s - \4nAs)}} (61) 
= exp{Z[(a - S)s - ±^ >£ (s) + w(l)}}, (62) 
where d := dim"H. In (I61II . we used the definition 

in,M ■■= - to&toip^r} pitf), (63) 

in the last line we introduced the function w defined by w(l) := 4- log(^ + 1) for 
every / <E N. Notice, that we also used the bound | spec(r;)| < (I + if on the 
spectrum of t; which is justified by Lemma [21 In fact, by observation of (|25D . it 
is easy to see, that for every a in the tensor product representation of Si on W® 1 
(see ||3D), 

(lg ; x| ®o)n =n(lf^ ®a) (64) 

holds. We will now show, that the argument of the exponential in (|52"j) becomes 
strictly negative for a suitable choice of s, sufficiently small e and large enough 
I. We define 

fi, e (s) := (a - S)s ~ j1>U*)- (65) 

By the mean value theorem it suffices to show that // e (0) < for small enough 
e > 0. For the derivative, we have 

fU0) = a-S-jD{p, te \\n). (66) 
The relative entropy term in (|6"6"]l can be lower bounded as follows. It holds 

D(pi,e\\Ti) = -S(pi, e ) - tr(p iiE logT;) 

= -S( Pl , £ ) + lS(p) + S (j^ £ of 1 J (67) 

>-S(p lte )+lS(p) + ^-J2 lS ^- ( 68 ) 
' ' ter 
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Notice that the equality in (|rJ7|) indeed holds, because the marginals on (C' x ')®' 
and H.® 1 of pi and 17 are equal and therefore equal to the marginals of /0/ j£ by 
definition for each e € (0, 1). The inequality in ((68|) is valid due to concavity of 
the von Neumann entropy. Because (|4Tj) holds, 



<S(pi) + 2dlog^ (69) 

is valid for e < ^ since for two states p,a € with ||p — < £ < -, 
Fannes' inequality [15] , 

|5(p)-S(tr)| < £ log^^, (70) 



is valid. Together with (|69|) . (|68| implies 

£>(pJ, c |h) > S(pi) - 2el\og± + lS(p) + ±-J2 lS (<rt) 

>-^E W - l0 § l T l - 2d lo S Ye + IS{P) + W\^ lS{at) 
I I teT I I tgT 

(71) 

= ii|£ £>(P*lb ® *t) - log |T| - 2d log ±. (72) 
' ' teT 

The inequality in (|7ip results from the fact, that the von Neumann entropy is 
an almost convex function, i.e. 

JV 

S(p)<J2PiS(Pi)+log(N) (73) 

for any mixture p = Yli=iPiPi °f states. Inserting ([72]) in (|66|) gives 

// i£ (0) < min |p ® tr t ) - 6 - ^- D(p t \ \p ® a t ) + 2e log ^ + - log |T| 

' ' teT 

<_£ + ! login, (74) 



provided that < e < £q (5) where Eq is small enough to ensure 2e log ^ < 



uugn lu tins uie alt lug 2e ^- ^ 

The mean value theorem shows that for s 6 (0, 1] 

//,e(s) = /i,e(0) + ^ £ (s')-S 

holds for some s' e (0, s). Since fi, e (0) = 0, (J73J) shows that we can guarantee 
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for small enough s. By (p^l) and (|75[) we obtain for e < 6q(6) and I large enough 
to make w(l) < ^ valid, 

tr((l w ®i - qi,s)pi.e) < exp{Z[/;, E (s) + «;(/)]} 

<exp|-Z^ S - W (0)} (76) 
< |T|-cxp{-/^}. 

Using dH]), we have (with e < £o) 

tr((l K ®i - gi,5)Pi) < ||/0j, e - Pilli + tr((l w »i - qi,s)pi,e) 

< 2e + |T|cxp{-Z-s}. 

8 

We can in fact, choose the parameter e dependent on I in a way that (£;)z^i 
decreases exponentially in Z, which proves the second claim of the lemma. □ 

In order to prove the direct part of the coding theorem for general sets of channels 
we have to approximate arbitrary sets of channels by finite ones. For a > 0, an 
a-net in CQ(X, U) is a finite set N a := {W,}^ C CQ(X, H) with the property, 
that for every channel W <G CQ(X,H) there exists an index i £ [N a ] such that 

\\W-Wi\\ cq <a (77) 

holds. For a given set W C CQ(X,H) an a-net N a in CQ(X 7 'H) generates an 
approximating set W Q defined by 

W a := {Wi £ M a : B cq (a, W t )nW^ 0}. (78) 

where B cq (a,A) denotes the a-ball with center A regarding the norm || ■ \\ cg . 
The above definition does not guarantee, that W a is a subset of W but each W a 
clearly generates a set W2a C W of at most the same cardinality, such that for 
every W £ W exists an index i £ [N a ] with 

\\W-Wi\\ cq <2a. (79) 

The next lemma states that we find good approximations of arbitrary compound 
cq-channels among such sets as defined above. The proof can be given by minor 
variations of the corresponding results in [9] , [10] , and we omit it here. 

Lemma 6. Let W := {W t }teT C CQ(X,H) and a £ (0, \). There exists a set 
T a C T which fulfills the following conditions 

1. \T a \ < (i ) 2|X|d2 , 

2. given any I £ N, to every t £ T one finds an index t' £ T a such that 

\\W®\x l ) - VF®V)lli < 2/a. (80) 
holds for every x l £ X 1 . Moreover, 



Arbitrarily varying & compound classical-quantum channels. 



19 



3. for every p € ^J(X), 



min x(p, W t > ) - inf x(p, W t ] 



<2alog^- (81) 



holds. 



The following lemma is from [2] and will be used to establish the weak converse 
in Theorem [T] It states that codes which have small average error probability 
for a finite compound cq-channcl contain subcodes with good maximal error 
probability of not substantially smaller size. 

Lemma 7 (cf. [2], Lemma 1). Let W : {Wt]teT C CQ(X.,W) be a compound 
channel with \T\ < oo and I G N. If (u\, D\) i ^ 1 is an (I, M{)-code with 



, Mi 



i=i 



Then there exists for every e > a subcode {u\., of size Mi <e — Ly^M;J 

with 

max max tT(W? l (u[.)(l H -£>•.)) < \T\(\ + e) (83) 

Finally, we have gathered all the prerequisites to prove Theorem [TJ 

Proof (of Theorem]^). The direct part (i.e. the assertion that the r.h.s. lower- 
bounds the l.h.s. in (|18[1 ) is proven by combining Lemma [T] with Lemma [3J Let 
p = argmaXp, g fp( X ) initeT x{p' : W t ). We show that for any 6 > 0, 

M X (p,W t )-6 (84) 

is an achievable rate. We can restrict ourselves to the case, where 
inf t6 T x(p> W t ) > 8 > holds, because otherwise the above statement is trivially 
fulfilled. The above mentioned lemmata consider finite sets of channels, therefore 
we choose an approximating set W ai (of cardinality T ai ) according to Lemma|S] 
for every I £ N, where we leave the sequence cvi,a2, ... initially unspecified. For 
every I g N and t' € T ai . let p t > , cr t ' be defined according to cq. (|2"2"j) and (12H)) , 
and further define states 

Pi ■= vFl E v iP>*i ( 85 ) 

and 

^:=^®p^-| E (86) 

1 Qil t'ST Q! 
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For a given number 77 with < rj < a;, Lemma [5] guarantees (for large enough I), 
with a suitable constant c > 0, the existence of a projection qi tV G /3((C' X ')® Z (8) 
with 

tr(gi^Pl) > l-|T ai |-2-' £ (87) 

and 

tr(®,,7l) < 2-'( a '-") (88) 

where we defined a; := vaxn.t>£T a D (pt'\\p 65 ov). This by virtue of Lemma 
[1] implies for every 7 > such that r\ + 7 < a; the existence of a cq-code 
(i^Dmeliy of size 

M ; = [2 l( - a '- r '-^ \ (89) 

and average error bounded by 

» ^E tr (^''( u ™)( 1 ^ ! - jD ™))^ 2 l T ^| 22 ^ + 4 -| T ^l 2 "' 7 - ( 9 °) 

t 1 1 a, 

1 m— 1 

Notice, that for other positive numbers 7, 6, trivial codes have Mi = 1 > 
|2Ko!-'?-7)j. Using ([ggj) we bt a in, 

y log M( > pin x(p, W t >)-r)-'y (91) 

> mfx(p,Wt) -r? -7- 4a; log (92) 

where the second inequality follows from Lemma |51 For the average error, it 
holds, 

S ^Wt ^ tr W(«m)(l«»»-- D m)) (93) 



tGT 



me [Mi] 



^ ™ x ^ E tr - £>Jj) + 2Za, (94) 

°' 1 me [AT,] 

< 2\T ai \ 2 2- l£ + 4|T Q; \2~* + 2l m . (95) 

The first of the above inequalities follows from Lemma |B1 the second one is by 
(|9H|) . Because we chose the approximating sets according to Lemma O 

g \ 2|X|rf 2 



|T Qi |<^-j (96) 

holds. In fact, if we specify ai to be ai :— 2~ lc for every I € N, where c is a 
constant with < c < min i ^pfrgg , ^Ixfd 7 } > tnG r - n - s °^ ®3) decreases expo- 
nentially for I — > 00. If we additionally choose 77 and 7, small enough to validate 
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5 > i] + 7 + 2a; log for sufficiently large i, the rate defined in ([54")) is shown 
to be achievable by and (|9"2")l . Since 5 was arbitrary, the direct statement 
follows. 

It remains to prove the converse statement. For the proof, we will construct a 
good code for transmission under the maximal error criterion and invoke the 
strong converse result given in [3] (see Remark [3]). We show, that for any 5 > 0, 



C C (W) < max inf x (p, W t ) 



(97) 



Let 6 > and assume that for some fixed I € N, C; := (u l m , D l m )^ l =1 is an 
(l,Mi)-code with 



SU P TT E tr W(«4)(lw«« " Dl m)) ^ A < 



m— 1 



(98) 



We always can find a finite subset T C T such that 



max infyfp, Wt)— max minx(p, Wt) 
P e<P(x)teT P e<p(x) te f 



S 

< - 
~ 2 



(99) 



holds (e.g. a set T a as in LcmmalHlfor suitable a). We set e := — jjr-. By virtue 

2|T| 

of Lemma [7] we find a subcode (u\. , £)'.)*2j C C/ of C; which has size 



M Le := 



■Mi 



and maximal error bounded by 

« tr (Wf'(^.)(l 



max max 

t 6 f j'eAf,, 



D 



< A«|T| + -. 



(100) 

(101) 
(102) 



If I is sufficiently large, the r.h.s. is strictly smaller than one. Therefore, by the 
strong converse theorem for coding under the maximal error criterion (see [S], 
Theorem 5.13), we have (with some constant K > 0) 



- log Mi e < max mm\(p, W t ) + K— 

I pGV(X) te f VI 

max inf v(p, Wt) + ^ + K-^=. 

~ P e'p(x)teT A '^' 2 VI 



(103) 
(104) 



The second line above follows from (j99|) . On the other hand, by (|100p . we have 



log M < log M;, 



loe 



2(l-e) J ■ 



(105) 
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Dividing both sides of (|105[) by I and combinig the result with (|104p shows that 
for sufficiently large I 

) log Mi < p max mf X <p, W t)+ 5 - + + ) log (^L_) (106) 
< max Mx(v,W t ) + S (107) 

holds, which shows (^7]) . Since S was an arbitrary positive number, we are done. 

□ 

Remark 3. While the achievability part for cq-compound channels regarding the 
maximal error criterion given in [9] required technical effort, the strong converse 
proof was rather uncomplicated. It was given there by a combination of Wol- 
fowitz' proof technique for the strong converse in case of classical compound 
channels and a lemma from |29| . 

Remark 4- We remark here, that a general strong converse does not hold for the 
capacity of compound cq-channels if the average error is considered as criterion 
for reliability of the message transmission. This can be seen by a counterexample 
given by Ahlswede in [T] (Example 1) regarding classical compound channels. 
However, we will see in the proof of Theorem [31 that in certain situations (espe- 
cially, where W is a convex set) a strong converse proof can be established. 

As a corollary to the achievability part of Theorem Q] above, we immediately 
obtain a direct coding theorem for the capacity of a finite cq-compound channel 
under the maximal error criterion. 

Corollary 1. For a finite compound cq-channel W := {Wt}teT C CQ(X.,H) 
we have 

C C (W)> max imnx(p,W t ) (108) 
P e*p(x) ter 

Proof. For an arbitrary number S > 0, we show, that 

max min y(p, Wt) — S (109) 
p6<P(X) ter 

is an achievable rate. Let {C;}/ £ n, Ci '■= {u l m , D^^^L^il € N, be a sequence of 
(I, Mj)-codes with 

lim inf — log Mi > max miny(p, Wt) — (HO) 

;->oo l p€ < P(X) t€T 



and 

"ief Mi 



, Mi 

E tr W(«m)(ltt«« ~ D l J) < \i (111) 



rn—1 
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for every I £ N, where lirm^oo A; = 0. Such codes exist by virtue of Theorem [TJ 
Because of Lemma[7J we find for each I G NasubcodeC; := (u l m . , D l m .) i£ ^^ C Ci 

of size Mi := [ ^ Mi\ and maximal error 

max max tr (W® l (u l mi )(l n ®i - D l rn J) < (A; + e t )\T\. (112) 

with the sequence (ez)^ defined by e\ := 2~ ( 3 f.a. I £ N, it is clear that we find 
a sequence of (Z,Mi)-subcodes {Cj}, eN , where C, := (u l m ., D l f.a. I e N, 
which fulfills 

lim max max tr (W t & (u l m J(l n ^ - D l „J) = (113) 

and 

liminf- log M; = liminf - logM; > max minx(p, Wt) — 5. (H4) 

l^oo I Z-i-oo / peq3(X) tgT 

□ 

Remark 5. The above corollary, although proven here for finite sets, can be ex- 
tended to arbitrary compound sets by approximation arguments, as carried out 
in [5J. Moreover, an inspection of the proofs in this section shows that the speed 
of convergence of the errors remains exponential. 

6 AVCQC 

6.1 The Ahlswede-Dichotomy for AVcqCs 

In this section, we prove Theorem [5] and Theorem [31 The proof of Theorem [2] 
is carried out via robustification of codes for a suitably chosen compound cq- 
channel. More specifically, to a given AVcqC A we take a sequence of codes for 
the compound channel W := conv(yl) that operates close to the capacity of W. 
Thanks to Theorem [TJ we know that there exist codes for W that, additionally, 
have an exponentially fast decrease of average error probability. The robustifi- 
cation technique then produces a sequence of random codes for A that have a 
discrete, but super-cxponcntially large support and, again, an exponentially fast 
decrease of average error probability. 

An intermediate result here is the (tight) lower bound on C A,r{A). 
A variant of the elimination technique of [3] is proven that is adapted to AVcqCs 
and reduces the amount of randomness from super-exponential to polynomial, 
while slowing down the speed of convergence of the average error probability 
from exponential to polynomial at the same time. 

Then, under the assumption that Ca.<i{A) > holds, the sender can send the 
required amount of subexponentially many messages in order to establish suf- 
ficiently much common randomness. After that, sender and receiver simply use 
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the random code for A. 

We now start out on our predescribed way. The following Theorcm[S]and Lemma 
[8] will be put to good use, but are far from being new so we simply state them 
without proof. 

Let, for each I £ N, Perm; denote the set of permutations acting on {1, . . . , I}. 
Let us further suppose that we are given a finite set S. We use the natural action 
of Perm; on S l given by a : S l — > S l , a(s l )i := s a -i^y 

Let T(l, S) denote the set of types on S induced by the elements of S , i.e. the set 
of empirical distributions on S generated by sequences in S l . Then Ahlswede's 
robustification can be stated as follows. 

Theorem 5 (Robustification technique, cf. Theorem 6 in [5] ). 

Let S be a set with |S| < oo and I £ N. If a function f : S l — > [0, 1] satisfies 

f(s l )q(s 1 )-...-q(s l )>l- 1 (115) 

s'es' 

for all q £ T(l, S) and some 7 £ [0, 1], then 

I £ /(*(*')) >l-(i + l)l s l. 7 Vs'eS'. (116) 

(TEEPcrni; 

The original theorem can, together with its proof, be found in [5]. A proof of 
Theorem [S] can be found in [5] . The following Lemma is borrowed from [1] . 

Lemma 8. Let A'eN and real numbers a\, . . . , ok, 61, ... , bx £ [0, 1] be given. 
Assume that 

1 K 1 K 

— 2^ ai >l-£ and ^£ &i - 1_£ ' 

i=i i=i 

hold. Then 

1 K 

— J2aA>l-2e. (118) 

i=l 

We now come to the promised application of the robustification technique to 
AVcqCs. 

Lemma 9. Let A = {^4 s } se s be an AVcqC. For every r\ > there is a sequence 
of (I, Mi)-codes for the compound channel W := conv(y^l) and an Iq £ N such 
that the following two statements are true. 

liminf - logMt > C C (W) - r] (119) 

I— >oo I 

1 1 Ml 

TOTi E w ^toM„- 1 &))„-\i%))>i-(i+i)m.2- l ° vi>i 

s l eS l 11 * — 4 Mi * — ' 
crGPermj i— 1 

(120) 

with a positive number c = c(|X|, dim %, A, rj). 



Arbitrarily varying & compound classical-quantum channels. 



25 



Remark 6. The above result can be gained for arbitrary, non-finite sets S as well. 
A central idea then is the approximation of conv(^4) from the outside by a convex 
polytopc. Since CQ(X.,H) is not a polytope itself (except for trivial cases), an 
additional step consists of applying a depolarizing channel Af p and approximate 
Af p (conv(A)), a set which does not touch the boundary of CQ(K,H), instead of 
conv(„4). 

This step can then be absorbed into the measurement operators, i.e. one uses 
operators Af£(D\) instead of the original D\ (i = 1, . . . , Mi). 
A thorough application of this idea can be found in [6] , where the robustification 
technique gets applied in the case of entanglement transmission over arbitrarily 
varying quantum channels. 

Proof. According to Lemma [TJ there is a sequence of (I, Mi) codes for the com- 
pound channel conv(^) = {W q : W q = J2 s es Q( s ) A s, 1 G *P(S)} fulfilling 

lim inf - log Mi > C c (conv (A)) - r) (121) 

and 

, Mi 

3l eN: M —J2HW® l (x>i)Dl)> l-2~ lc Vl>l . (122) 

Weconv(A) Ml * — ' 
i— 1 

The idea is to apply Theorem [SJ Let us, for the moment, fix an N 3 I > lo and 
define a function fi : S l — > [0, 1] by 



fi^)--=j F J2tr(AA^Dl). (123) 

1 i=l 

Then for every q G ^J(S) we have 

l Mi 

E fi{s l )j[q{si) = w Y, t < w T( xl i) Dl i)^ l - 2 ~ lc - ( 124 ) 

s ! GS' i=l 1 i=l 

It follows from Theorem [51 that 

l_ (i + l)|S|. 2 -Ic< 1 £ /i(a(si)) (125) 

i i Ml 

77 E ttE 41 ^'^ 1 ^))^ 1 ^)) Vs ' eS ' 



Z! ^ M z 

crGPcrm; 2—1 



(126) 



holds, where 

a{B l ®...®Bi)-~B ry -i (1) ®...®B^i ( i ) VBi,...,Bj € B(W) (127) 

defines, by linear extension, the usual representation of Perm; on B(H)® 1 and 
the action of Perm; on X' is analogous to that on S l . □ 
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It is easily seen from the above Lemma [S] and Theorem [TJ that the following 
theorem holds. 

Theorem 6. For every AVcqC A, 

C Ar (A) >C c {coiw(A)) = max inf x(P,A). (128) 

pe<P(X) Aecom(A) 

In the following we give a proof of the remaining inequality in (|19[) . In fact, we 
prove the stronger statement Theorem [3J 

Proof (of Theorem^). We define W := conv(_4). Since |S| is finite, this set is 
compact. The function x('i ') is a concave-convex function (see eq. (U])), therefore 
by the Minimax Theorem, 

max min v(p, W) — min max y(p, W) (129) 
pe^(x) w<=w wew P e<p(X) 

holds. Both sides of the equality are well defined, because we are dealing with a 
compact set. Let an arbitrary W q <G W be given by the formula 

W q = ^q(a)A., (130) 

where q G ?p(X). Set, for every Z € N, q® l (s l ) := n'=i <?( s 0- Lct A e [°> 1), * > 
and (fii)i^ be a sequence of (Z, M;)-random codes such that both 

liminf - log Mt = C A ,r(A,X) - S (131) 

l— too l 

and 

, M, 

liminf min — EMA s! (^)^((^,^)^i) > 1 - A. (132) 

i—1 

For every Z G N it holds that 

I 5>(Wf dw((t4,^i) (133) 

i— 1 

= E <^V) / $>(A.,(i4)£>i) d W ((t4,^)<='i) (134) 
s'es' ^ i=i 

> min / E W^.' d«((«i,-Di)<='i)> (135) 



which shows, that 

M 



U , minf / TrE tr W( u i) 2? *) dw((«i,-Di)^i) > 1 - A 



(136) 
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discrete memoryless cq-channel W q satisfying 



holds. It follows the existence of a sequence (i4,-D')ieN of (Z,M;)-codes for the 



lim inf - log Mi = CaAA A) - S and (137) 

I— >oo [ 

1 Mi 

liminf TT £ tr W' («')£>') > 1 - A. (138) 

i=l 

By virtue of the strong converse theorem for single cq-DMCs given in [35] (also 
to be found and independently obtained in [25]), for any A <E [0,1), S > it 
follows 

C A , r (A, A) - 5 = lim inf - logM; (139) 

< max (p,W q ) (140) 
pe«p(x) 

and, since W q £ W was arbitrary, 

C A r(A,X)-S < min max X (p, W) (141) 
ivew P eq3(x) 

= max min v(p, W). (142) 
P e<p(X) wew 

The equality in (|142l) holds by (|129l) . Since 6 was an arbitrary positive number, 
we are done. □ 

The following lemma contains the essence of the derandomization procedure. 

Lemma 10 (Random Code Reduction). Let A = {A s } se s be an AVcqC, 
I £ N ; (ii an (I, Mi) random code for A and 1 > ei > with 

r 1 M ' 

e^uA) := inf / — ^tr(A^)^M(4,^)^i) > (143) 
s'es' J Mi -f—' 

2 — 1 

Lei n,m e E. TTien if 4^ < l~ m and 2 log |S| < p"™" 1 i/iere e^si Z™ (Z.M;)- 
deterministic codes {x\ j, ■ ■ ■ , j, D[ j, ■ ■ ■ , D l M[ ■) (1 < j < Z™j /or „4 sitc/i 

-n £ m £ tr (^< K,)^) ^ 1 - rm ^ e s '- ( 144 ) 

i=l i=l 

Proof. Set e := 2l~ m . By the assumptions of the lemma we have 

e( W ,.4) := min / — ]T tr(A si (^)Z><)d W ((4 D[)f^) > 1-e,. (145) 
s ! eS ! J Mi ^ 

2 — 1 

For a fixed K 6 N, consider if independent random variables /li with values in 
((X') M <) x Mm^U® 1 )) which are distributed according to m. 
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Define, for each s l G S*, the function Ps , : ((X') Mi ) x M Ml {U m )) -> [0, 1] 

(x[, . . . , x l Mi ,D{,...,D l Mi )^^ Y^i tr(A si (x?)D\). 

We get, by application of Markovs inequality, for every s l G S': 



1 K 

»(1 - - V}) sl (l) > ell) = p(2 Jf -Si=iP.«(^) > 2 K£ / 2 ) (146) 



K 

j'=i 



< 2"- R " £ / 2 E(2 (Ar " E f=i p = ,(Aj)) ). (147) 



The yli arc independent and it holds 2* < 1 + t for every t £ [0, 1] as well as 
log(l + ei) < 2ei and so we get 

P(l --Vp^yl,) > e/2) < 2- K£ / 2 E(2 K -^=i p s liA J ) ) (148) 



K 

3=1 



= 2- Jf8 / a E(2^i( 1 -P.'(^))) (149) 

= 2 -Jf e /2 E ( 2 (l-p 3! (A 1 ))^ ( 15Q j 

<2- K£ / 2 E(l + (l- Ps ,(^i))) A ' (151) 

< 2 -if6/2( 1+ej )JC (152) 

< 2 -^/ 2 2 X£ / 4 (153) 

= 2- Ke / i . (154) 



Therefore, 



P(lf> sl (^.) > 1 - e/2) > 1 - |S| Z 2-^/ 4 . (155) 

i=i 

By assumption, 2 log | S | < ^C™-™- 1 ) an d thus the above probability is larger 
than zero, so there exists a realization A\, . . . , Ain such that 

^E^ tr (W si (^)>l-^- (156) 



2=1 



□ 



Now we pass to the proof of Theorem [5J If CA, r (A) = or Ca,<i(A) = there is 
nothing to prove. So, let C A,r{A) > and Ca.ci(A) > 0. Then we know that, to 
every I <E N, there exists a deterministic code for A that, for sake of simplicity, 
is denoted by (x\, . . . , x\ 2 , D\, . . . , Dp), such that 

i 2 

7" — 1 

and 6/ \ 0. Also, by Lemma [HI to every e > there is a sequence (^ m )meN 
of random codes for transmission of messages over A using the average error 
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probability criterion and an mo <E N such that 



lim inf — log M m > C A r (A) -e (158) 

m— >oo 771 



/ E tr(A sm (^)D].)dM m ((^, ...,« m) 4..., ^J) > 1 - 2_mC 

(159) 

for all m > mo with a suitably chosen (and possibly very small) c > 0. This 
enables us to define the following sequence of codes: Out of the random code, 
by application of Lemma [TU] and for a suitably chosen mi > mo such that the 
preliminaries of Lemma [10] arc fulfilled, we get for every m > mi a discrete 
random code supported only on the set {(j/ij, • ■ • , UM m ■ ■ ■ > EM m ,j)}JLi 

such that 

lim inf - log M m > C A . r (A) - e (160) 

m— >oo [ 

1 m 2 1 M m 

—2 E jt E tr (^(C;)^') ^ 1 - ^ Vm ^ ( 161 ) 

3=1 m i=l 



Now all we have to do is combine the two codes: For l,m £ N, define an (I 

Mr 

,0 



a ^ )-deterministic code with the doubly-indexed message set {«, ij-'^^TLi 



by the following sequence: 

((si,l#),.Di® Eja^. (162) 
For the average success probability, by Lemma [5] it then holds 



V M„ 

1 

mm 



m ^-EE tr (V,")(^U«))^ ® Eii) > 1 - 2max{ £; , I}. 

i=l j— 1 

(163) 

Now let there be sequences (k)teN and (m t )teN such that Z t = o(l) and 
Z t + m t = t f.a. t e N. Define a sequence of (t, — )-deterministic codes 

(x\ , . . . , x% M , £>i , . . . , Dp M ) for A by applying, for each t € N, the above 
described procedure with m = m t and Z = Z t . Then 

lim inf - log Z? M m . > R and (164) 



6.2 M-Symmetrizability 

In this section, we prove Theorem 2) 
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Proof. We adapt the strategy of [22], that has already been success- 
fully used in [§]. Assume A is m-symmctrizable. Let I £ N. Take 
any a 1 , b l £ X ( . Then there exist corresponding probability distributions 
p(-\ai), . . . ,p(-\ai),p(-\bi), . . . ,p(-\bi) € *P(S) such that the probability distri- 
butions p(-\a l ),p(-\b l ) e qj(S') defined by p(s l \a l ) := llLiPOik), pW) : = 
IlLi P( s i| & satisfy 

E P^VM-^') = E K^K'(&') (166) 
s'es' s'es 1 

and thereby lead, for every two measurement operators D a ,Db > satisfying 
Z) a + Z?b < , to the following inequality: 

E p(*V)tr(4.'(a')A,) = E P(s l \b l MA s i(b l )D a ) (167) 
s'es' s'es' 

< E P(s l \b l )tr(A s , (b l )(l H ®, - £> 6 )) (168) 

s'es' 

= 1- E p(s l \b l )tt(A st (b l )D b ). (169) 
s'es' 

Let a sequence of (I, Mi) codes for message transmission over A us- 
ing the maximal error probability criterion satisfying M; > 2 and 
mmjgrjvfji min s i g g! tr(j4 s i {x\)DA = 1 — Ei be given, where e; \ 0. Then from 
the above inequality we get 

l-ei<l-(l-e,) ej>l/2. (170) 

Therefore, CU,tz(v4) = has to hold. 

Now, assume that A is not m-symmetrizable. Then there are x,y € X such that 
conv({A s (x)} a£a ) n conv({A(y)}ses) = 0- (171) 
The rest of the proof is identical to that in [B] with I set to one. □ 



6.3 Relation to the zero-error capacity 

A remarkable feature of classical arbitrarily varying channels is their connec- 
tion to the zero-error capacity of (classical) d.m.c.s, which was established by 
Ahlswede in [3] Theorem 3]. 

We shall first give a reformulation of Ahlswede's original result and then consider 
two straightforward generalizations of it result, one for cq-channels, the other 
for quantum channels. In both cases it is shown, that no such straightforward 
generalization is possible. 
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Ahlswede's original result. Ahlswcdc's result can be formulated using the fol- 
lowing notation. For two finite sets A, B, C(A, B) stands for the set of channels 
from A to B, i.e. each element of W € C(A,B) defines a set of output proba- 
bility distributions {Ty(-|a)} a£ A- With slight abuse of notation, for each DcB 
and a <G A, W(D\a) := J2beD W(b\a). The (finite) set of extremal points of the 
(convex) set C(A, B) will be written E(A, B). 

For two channels W U W 2 € C(A,B), their product Wi ® W 2 € C(A 2 ,B 2 ) is 
defined through W\ (8 W2{b 2 \a 2 ) := W / i(&i|ai)VF 2 (&2|a2). An arbitrarily varying 
channel (AVC) is, in this setting, defined through a set W = {W^jses C C(A, B) 
(we assume S and, hence, |W|, to be finite). The different realizations of the 
channel are written 

W s i := W S1 <g> . . . <g> W Sl (s l eS l ) (172) 

and, formally, the AVC W consists of the set {W s i} s i^s l , leN- 
An (I, M;)-codc for the AVC W is given by a set {a\}^ C A 1 called the 'code- 
words' and a set {D l i }ff} 1 of subsets of B' called the 'decoding sets', that satisfies 

A nonnegative number R G K is called an achievable maximal-error rate for the 
AVC W, if there exists a sequence of (I, Mi) codes for W such that both 

liminf -log Mi > R and lim min min W s i (-Dilxj) = 1. (173) 

(->oo { I— too s l GS ! l<i<Mi 

The (deterministic) maximal error capacity C max (W) of the AVC W is, as usu- 
ally, defined as the supremum over all achievable maximal-error rates for W. 
Much stronger requirements concerning the quality of codes can be made. An 
(I, M;)-code is said to have zero error for the AVC W, if for all 1 < i < Mi and 
s l G S l the equality W s i{Dl\x\) = 1 holds. 
The zero error capacity Co(W) of the AVC W is defined as 

C (W) := lim max{\ log Mj : 3 (I, M/)-codc with zero error for W}. (174) 

l— > oo I 

The above definitions carry over to single channels W G C(A,B) by identifying 
W with the set {W}. 

In short form, the connection [3J Theorem 3] between the capacity of certain 
arbitrarily varying channels and the zero-error capacity of stationary memoryless 
channels can now be reformulated as follows: 

Theorem 7. Let W G C(A, B) have a decomposition W = X) s es l( s )W s , where 
{W s } seS C E(A, B) and q(s) > Vs G S. Then for the AVC W := {W s } seS - 

C (W) = C max (W). (175) 

Conversely, for every AVC W = {W s } s6 s C S(A, B) and every q G ^P(S) with 
q(s) > Vs G S, equation Ji7<5[ ) holds for the channel W := J2 S £S<l( s )Ws- 
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Remark 7. Let us note at this point, that the original formulation of the theorem 
did not make reference to extremal points of the set of channels, but rather used 
the equivalent notion "channels of — 1-type". 

Remark 8. By choosing W € E(A, B), one gets the equality C (W) = C max (W). 
The quantity C maK (W) being well-known and easily computable, it may seem 
that Theorem [7] solves Shannons's zero-error problem. This is not the case, as 
one can verify by looking at the famous pentagon channel that was introduced 
in [571 Figure 2.]. The pentagon channel is far from being extremal. That its 
zero-error capacity is positive [UJ is due to the fact that it is not a member of 
the relative interior riE(A, B). 

Recently, in [3], this connection was investigated with a focus on entanglement 
and strong subspace transmission over arbitrarily varying quantum channels. 
The complete problem was left open, although partial results were obtained. 

A no-go result for cq-channels. We will show below that, even for message trans- 
mission over AVcqCs, there is (in general) no equality between the capacity 
C (WO of a channel W € CQ(X,H) and any AVcqC A = {A 3 } s&s constructed 
by choosing the set {A s } sg s to be a subset of the set of extremal points of 
CQ(X,H) such that 



holds for a A G ^P(S). Observe that the requirement that each A s (s G S) be 
extremal in CQ(X,'H) is a natural analog of the decomposition into channels of 
— 1-type that is used in the second part of [3j. 

A first hint why the above statement is true can be gained by looking at the 
method of proof used in [3J, especially equation (22) there. The fact that the 
decoding sets of a code for an arbitrarily varying channel as described in [3] have 
to be mutually disjoint, together with the perfect distinguishability of different 
non-equal outputs of the special channels that are used in the second part of 
this paper, is at the heart of the argumentation. 

The following lemma shows why, in our case, it is impossible to make a step that 
is comparable to that from [3J equation (21)] to [3J equation (22)]. 

Lemma 11. Let A = {A s } s£ s be an AVcqC with Ca,<i{A) > and < R < 
Ca,<i(A). To every sequence of (I, Mi) codes satisfying liminfj^oo ylogM; > R 
and limi_ s . 00 minjg^/;] m i n s'es' tr(A s i (x\)Dj) = 1 there is another sequence of 
(I, Mi) codes with modified decoding operators D\ such that 




(176) 



1) 



liminf -logMj > R 



(177) 




in mintr(A s( (x^)^) = 
Mi\ s'es 1 



(178) 



3) 



V * e [Mi], I G N, tr(A s i(x\)Dl) < 1 



(179) 
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Proof. Just use, for some c > 0, the transformation D\ := (1 — 2 lc )D\ + 

After this preliminary statement, we give an explicit example that shows where 
the construction in equation (|176|) must fail. 

Lemma 12. Let X = {1,2} and % = C 2 . Let {ei,e2} be the standard basis of 
H and V>+ := \flj^i e i + e 2)- Define W G CQ{X,H) by W(l) = |ei)(ei| and 
W(2) = \ip+)(ip+\- Then the following hold. 

1. W is extremal in CQ(X, W) 

2. For every set {AJ seS C CQpL,H) and every A G q3(S) such that JT75p 
Mis, {4 8 }. 6 s = {V^}. 

5. C (W) = 0, but C A , d {{W}) > 0. 

Proof. 1) Let, for an x G (0, 1) and W U W 2 G CQ(X,H), 

VK = .T^i + (l-a;)iy2- (180) 

Then, clearly, 

|ei)(ei| = xW^l) + (1 - x)W 2 (l) =► Wi(l) = W 2 (l) = W(l) (181) 
and 

|^+>(V»+|=a;Wi(2) + (l-a:)W3(2) Wi(2) = W 2 {2) = W(2), 

(182) 

S0VF = VKi = ^2. 

2) is equivalent to 1). 

3) It holds ti{W{i)W(j)} > 1/2 G X). Let I G N. Assume there are two 
codewords a 1 , b l G X' and corresponding decoding operations C,D > 0, C + D < 
Igi, such that 

tr{W®V)C} = t^W 8 "^)!)} = 1 
(=► tr{W® , (o')D} = tr{W 8l (&')C} = 0). (183) 

Then we may add a third operator £ := 1® 2 ' — C — D and it holds that 

tr{W®'(o')-B} = tr{iy 0/ (&')£} = 0. (184) 

From equations (|184|l and (|183p we deduce the following: 

= \^DW® l {a l )VD = VcW® l (b l )Vc = 0. (185) 
With these preparations at hand, we are led to the following chain of inequalities: 
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< tx{W® l {a l )W® l (b 1 )} (186) 
= tr{W 9l (C + D + E){a l )W m (b l ){C + D + E)} (187) 
= (CW®V)> W® l (b l )C) H s + (CW 0l {a l ),W® l (b l )D} HS 
+ (CW®V)> W® l {b l )E) HS + {DW m (a l ),W® l {b l )C) H s 
+ {DW m {a l ) 1 W® l {b l )D) HS + {DW m {a l ),W m {b l )E) HS 
+ (EW® l (a l ), W m {b l )C) HS + (EW® l (a l ), W m {b l )D) HS 
+ {EW®\a l ), W® l {b l )E) HS (188) 
= 0, (189) 



as can be seen from a repeated application of the Cauchy-Schwarz-inequality 
to every single one of the above terms and use of equation (|185[) . Thus, by 
contradiction, Cq(W) = has to hold. 

Now assume that the AVcqC {W} is m-symmetrizable. This is the case only if 

W(l) = W(2) (190) 
holds, which is clearly not the case. Thus, Ca.<i{{W}) > 0. □ 

A no-go result for quantum channels. We now formulate a straightforward ana- 
logue of Theorem[7]for quantum channels. To this end, let us introduce some no- 
tation. We heavily rely on [6] . The set of completely positive and trace-preserving 
maps from B{'H) to B(K.) (where both % and K, are finite-dimensional) is de- 
noted C(H,!C). For a Hilbert space H, S(T~L) denotes the set of vectors of unit 
lenght in it. 

An arbitrarily varying quantum channel (AVQC) is defined by any set 3 = 
{A/" s } seS C C(H,K.) and formally given by {Af s i} s 'es l ,ieK, where 

Af s i := Af Sl S3 ... ® Af st (s 1 e S 1 ). (191) 

Let 3 = {Arises be an AVQC. An (I, k{)— strong sub space transmission code for 
3 is a pair (V l ,TZ l ) £ C{F u n® 1 ) x C{K® 1 , J"/), where T u T[ arc Hilbert spaces 
and dim J 7 ; = fcj, Jj C T[. 

Definition 17. A non-negative number R is said to be an achievable strong 
subspace transmission rate for the AVQC 3 = {Af s }ses if there is a sequence of 
(I, ki)— strong subspace transmission codes such that 

1. liminf;_ i . 0O jlogfc; > R and 

2. Um^inf^gsimiiVes^^^^oAC. °^(|V>)(W) = 1- 

The random strong subspace transmission capacity -4 s ,random(3) of 3 is defined 
by 

.y. _ R is an achievable strong subspace\ (192) 

s.dcu ) ■ SU P S • i ransm i ss i on rate for 3 J ' 
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Sclf-cvidcntly, wc will also need a notion of zero-error capacity: 

Definition 18. An (l,k) zero-error quantum code (QC for short) (J 7 , V,1Z) for 
Af e C(H,K) consists of a Hilbert space T, V G C{F,H m ), K € C(K,®\F) with 
dim J 7 = k such that 

min {x,KoN® 1 oV{\x){x\)x) = 1. (193) 

x&F,\ \x\ | = 1 

The zero-error quantum capacity Qo(AP) of Af € C(H,IC) is now defined by 

Qo(Af) := lim — log maxjdim T : 3(1, k) zero-error QC for Af}. (194) 

I— >oo / 

Conjecture 1. Let Af <E C(H, Kf) have a decomposition Af = J2 s es li^-^s, where 
each 7V S is extremal in C(U,IC) and q(s) > Vs e S. Then for the AVQC 
J := {A/;} se s: 

Qo(Af) = A,det(3). (195) 

Conversely, for every AVQC 3 = {jV" s }seS with A/" s being extremal for every 
s6S and every q G *P(S) with q(s) > Vs G S, equation (|195[) holds for the 
channel Af := J2 seS q(s)Af s - 

Remark 9. One could formulate weaker conjectures than the one above. A crucial 
property of extremal classical channels that was used in the proof of Theorem [7] 
was that W s i(-|a;-) is a dirac-measure for every codeword x\, if only {W^ijses C 
E(A,B). 

This property gets lost for the extremal points of CifH, Kf) (see the channels that 
arc used in the proof of Theorem [5]), but could be regained by restriction to 
channels consisting of only one single Kraus operator. 

This conjecture leads us to the following theorem: 
Theorem 8. Conjecture^ is wrong. 

Remark 10. As indicated in Remark [9j there could still be interesting connec- 
tions between (for example) the deterministic strong subspacc transmission ca- 
pacity of AVQCs and the zero-error entanglement transmission of stationary 
memory less quantum channels. 

Proof. Let H = JC = C 2 . Let {eo,ei} be the standard basis of C 2 . Consider, 
for a fixed but arbitrary x e [0, 1] the channel Af x G C(H,JC) defined by Kraus 
operators A\ := \/\ — x 2 |eo)(ei| and A2 := |eo)(eo| + x|ei)(ei |. As was shown in 
|30j . this channel is extremal in C(H, JC). It is also readily seen from the definition 
of Kraus operators, that it approximates the identity channel id^2 £ C(fH,JC): 

lim \\Af x — zrf C 2 j| = 0. (196) 

X— >1 

Now, on the one hand, Af x being extremal implies span({A* A,} 2 = Af(C 2 ) 
for all x € [0, 1) (where M(C 2 ) denotes the set of complex 2x2 matrices) by 
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[T2l Theorem 5]. This carries over to the channels N® 1 for every I £ N: Let the 
Kraus operators of N® 1 be denoted {A i i} i i e ^ 12 y , then 

span({A*,Ajj} i i j ji e { lj2 }i) = {M : M is complex 2 ; x 2 i -matrix}. (197) 

On the other hand, it was observed e.g. in [TB], that for two pure states 
\4>){(j>\, G S((C 2 )® 1 ), the subspace spanned by them can be transmitted 

with zero error if and only if 

|V) (0| -L span({A* i A ji } t '. J 'e{i,2}0- (198) 

This is in obvious contradiction to equation (|197|) . therefore Qq(N x ) = Vi £ 
[0,1). 

On the other hand, from equation (|196[) and continuity of -4 s ,dct(') m the speci- 
fying channel set ([6], though indeed only the continuity results of [23] that were 
also crucial in the development of corresponding statements in |Hj are really 
needed here) we see that there is an X € [0, 1) such that for all x > X we have 
A sAct {{N x }) > 0. Letting i = Iwe obtain Q (Af x ) = and A S Act(Wx}) > 0, 
so Qo(Afx) 7^ As,det(J^x) in contradiction to the statement of the conjecture. 

□ 
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